The TASX-environment: an XML-based toolset for time aligned speech corpora

نویسندگان

  • Jan-Torsten Milde
  • Ulrike Gut
چکیده

This paper describes the design and implementation of an XML-based corpus environment for multi-tier annotated speech data. The TASX-environment (TASX: Time Aligned Signal data eXchange format) constitutes the technical basis for a corpus designed to explore the acquisition of prosody by second language learners. It supports all aspects of the corpus setup procedure: XML-based annotation of the speech data, all transformation of non XML-annotations, and the web-based analysis and dissemination of the data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Prosodic Corpus of Non-Native Speech

The paper describes the design and implementation of an XML-based corpus environment for prosodically annotated data. The TASX-environment (TASX: Time Aligned Signal data eXchange format) constitutes the technical basis for a corpus designed to explore the acquisition of prosody by second language learners. It supports all aspects of the corpus setup procedure: XML-based annotation of the speec...

متن کامل

The TASX-environment: an XML-based corpus database for time aligned language data

The paper describes the design and implementation of an XML-based corpus environment for time aligned language/signal data. The TASX-environment constitutes the technical basis for a phonetic corpus designed to explore the acquisition of prosody by second language learners.

متن کامل

Querying Annotated Speech Corpora

This paper is concerned with querying annotated speech corpora. A growing number of such corpora is currently being created worldwide; however, their usefulness for a wider research community is restricted by the lack of standard tools for creating, editing, annotating, storing and querying them. Two solutions for these problems are presented here: the XML-based data format TASX for corpus crea...

متن کامل

Multimodale bilinguale Korpora gesprochener Sprache: Korpuserstellung, -analyse und -dissemination in der TASX-Umgebung

Zusammenfassung: Dieser Beitrag beschreibt die TASX-Korpusumgebung, ein XMLbasiertes System zur Erstellung und Auswertung von großen Sprachkorpora. Das Time Aligned Signal Data Exchange Format (TASX) wurde speziell entwickelt für die Annotation zeitlich geordneter, multimodaler Sprachdaten. Anhand zweier exemplarisch vorgestellter multimodaler, multilingualer Korpora gesprochener Sprache werden...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002